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This invention relates to protein fragments containing epitopic regions of pneumococcal surfac protein 
A (PspA), the major virulence factor of Streptococcus pneumoniae . 

Streptococcus pneumoniae is an important cause of otitis media, meningitis, bacteremia and pneumonia. 
Despite the use of antibiotics and vaccines, the prevalence of pneumococcal infections has declined little over 
5 the last twenty-five years. 

It is generally accepted that immunity to Streptococcus pneumoniae can be mediated by specific antibo- 
dies against the polysaccharide capsule of the pneumococcus. However, neonates and young children fail to 
make an immune response against polysaccharide antigens and can have repeated infections involving the 
same capsular serotype. 

10 One approach to immunising infants against a number of encapsulated bacteria is to conjugate the cap- 

sular polysaccharide antigens to proteins to make them immunogenic. This approach has been successful, 
for example, with Haemophilus influenzae b (see U.S. Patent No. 4,496,538 to Gordon and U.S. Patent No. 
4,673,574 to Anderson). However, there are over eighty known capsular serotypes of S. pneumoniae of which 
twenty-three account for most of the disease. For a pneumococcal polysaccharide- protein conjugate to be sue- 
ts cessful, the capsular types responsible for most pneumococcal infections would have to be made adequately 
immunogenic. This approach may be difficult, because the twenty-three polysaccharides included in the pre- 
sently-available vaccine are not all adequately immunogenic, even in adults. Furthermore, such a vaccine 
would probably be much more expensive to produce than any of the other childhood vaccines in routine use. 
An alternative approach for protecting children, and also the elderly, from pneumococcal infection would 
20 be to identify protein antigens that could elicit protective immune responses. Such proteins may serve as a 
vaccine by themselves, may be used in conjunction with successful polysaccharide-protein conjugates, or as 
carriers for polysaccharides. 

In McDaniel et al (I), J. Exp. Med. 160:386-397, 1984, there is described the production of hybridoma an- 
tibodies that recognize cell surface proteins on S. pneumoniae and protection of mice from infection with certain 
25 strains of encapsulated pneumococci by such antibodies. This surface protein antigen has been termed "pneu- 
mococcal surface protein A" or PspA for short. 

In McDaniel et al (II), Microbial Pathogenesis 1:519-531, 1986, there are described studies on the char- 
acterization of the PspA. From the results of McDaniel (II), McDaniel (III), J.Exp. Med. 1 65:381-394, 1987, Walt- 
man et al., Microb. Pathog. 8:61-69, 1990 and Crain et al., Infect. Immun. 58: 3293-3299, 1990, it was also 
30 apparent that the PspAs of different strains frequently exhibit considerable diversity in terms of their epitopes, 
and apparent molecular weight. 

In McDaniel et al (III), there is disclosed that immunization of X-linked immunodeficient (XID) mice with 
non-encapsulated pneumococci expressing PspA, but not isogenic pneumococci lacking PspA, protects mice 
from subsequent fatal infection with pneumococci. 
35 In McDaniel et al (IV), Infect. Immun., 59:222-228, 1991, there is described immunization of mice with a 

recombinant full length fragment of PspA that is able to elicit protection against pneumococcal strains of cap- 
sular types 6A and 3. 

In Crain et al, (supra) there is described a rabbit anti-serum that detects PspA in 100% (n = 95) of clinical 
and laboratory isolates of strains of S. pneumoniae . When reacted with seven monoclonal antibodies to PspA, 
40 fifty-seven S. pneumoniae isolates exhibited thirty-one different patterns of reactivity. Accordingly, although 
a large number of serologicatly-different PspAs exist, there are extensive cross-reactions between PspAs. 

The PspA protein type is independent of capsular type. It would seem that genetic mutation or exchange 
in the environment has allowed for the development of a large pool of strains which are highly diverse with 
respect to capsule, PspA, and possibly other molecules with variable structures. Variability of PspA's from dif- 
45 ferent strains also is evident in their molecular weights, which range from 67 to 99 kD. The observed differ- 
ences are stably inherited and are not the result of protein degradation. 

Immunization with a partially purified PspA from a recombinant X2 gtil clone, elicited protection against 
challenge with several S. pneumoniae strains representing different capsular and PspA types, as described 
in McDaniel et al (IV), Infect. Immun. 59:222-228, 1991. Although clones expressing PspA were constructed 
50 according to that paper, the product was insoluble and isolation from cell fragments following lysis was not 
possible! 

While the protein is variable in structure between different pneumococcal strains, numerous cross- 
reactions exist between all PspA's, suggesting that sufficient common epitopes may be present to allow a sin- 
gle PspA or at least a small number of PspA's to elicit protection against a large number of S. pneumoniae 
55 strains. 

In addition to the published literature specifically referred to above, the inventors, in conjunction with co- 
workers, have published further details concerning PspA's, as follows: 

1. Abstracts of 89th Annual Meeting of the American Society for Microbiology, p.125, item D-257, May 
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1989; 

2. Abstracts of 90th Annual Meeting of the American Society for Microbiology, p.98, item D-106, May 1990; 

3. Abstracts of 3rd International ASM Conference on Streptococcal Genetics, p.11, item 12, June 1990 ; 

4. Talkington etal, Infect. Immun. 59:1285-1289, 1991; 
5 5. Yother etal (I), J. Bacterid. 174:601-609, 1992; 

6. Yother et al (II), J. Bacterid. 174:610-618, 1992; and 

7. McDaniel et al (V), Microbiol Pathogenesis, 13:261-268. 

In the co-pending United States patent applications Serial Nos. 656,773 and 835,698 (corresponding to 
published WO 92/14488) as well as in Yother et al (I) and (II), there are described the preparation of mutants 
10 of S. pneumoniae that secrete an immunogenic truncated form of the PspA protein, and the isolation and pur- 
ification of the secreted protein. The truncated form of PspA was found to be immunoprotective and to contain 
the protective epitopes of PspA. The PspA protein described wherein is soluble in physiologic solution and lacks 
at least the functional cell membrane anchor region. 

In the specification which follows and the drawings accompanying the same, there are utilized certain ac- 
ts cepted abbreviations with respect to the amino acids represented thereby. The following Table I identifies 
whose abbreviations: 



TABLE I 



20 


AMINO ACID ABBREVIATIONS 




A = Ala = Alanine 


M = Met = Methionine 




C = Cys - Cysteine 


N = Asn = Asparagine 


25 


D = Asp = Aspartic Acid 


P = Pro = Proline 




E = Glu = Glutamic Acid 


Q = Gin = Glutamine 




F = Phe = Phenylalanine 


R = Arg = Arginine 


30 


G = Gly = Glycine 


S = Ser = Serine 




H = His = Histidine 


T = Thr = Threonine 




I = He = Isoleucine 


V = Val= Valine 


35 


K = Lys = Lysine 


W = Try = Tryptophen 




L = Leu = Leucine 


Y = Tyr = Tyrosine 



In accordance with the present invention, there has been identified a 68-amino acid region of PspA from 
the Rx1 strain of Streptococcus pneumoniae which not only contains protection-eliciting epitopes, but also is 
sufficiently cross-reactive with other PspA's from other S. pneumoniae strains so as to be a suitable candidate 
for the region of PspA to be incorporated into a recombinant PspA vaccine. 

The 68-amino acid sequence extends from amino acid residues 192 to 260 of the Rx1 PspA protein. While 
the disclosure herein refers particularly to the specific 68 amino acid sequence of the Rx1 PspA protein, any 
region of a PspA protein from any other S. pneumoniae species which is homologous to this sequence of the 
Rx1 PspA protein is included within the scope of the invention/for example, from strains D39 and R36A. 

Accordingly, in one aspect, the present invention provides an isolated pneumococcal surface protein A 
(PspA) protein fragment comprising amino acid residues corresponding to all or some of amino acid residues 
1 92 to 260 of the PspA protein of the Rx1 strain of Streptococcus pneumoniae containing at least one protec- 
tion-eliciting epitope and optionally up to a further 25 residues of said protein in the NH 2 terminal direction 
and/or the COOH terminal direction, or being effectively homologous with such a protein fragment 

The protein fragment may be one containing an amino acid sequence corresponding to or homologous to 
the amino acid residues 192 to 260 of the PspA protein f the Rx1 strain and hence may comprise fragments 
larger or small r than on s containing the specific amino acid sequence. 

The protein fragment of th invention may be produc d recombinantly in the f rm of a truncated C-terminal 
deleted product containing th protein fragment, specifically a truncated C-terminal-deleted product containing 
the approximat ly C-terminal third of an a-helical region of the native PspA protein. 

The amino acid sequence of the protein fragment ne d not be that found in strain Rx1 but can be based 
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on a corresponding sequence from another strain. Thus, the present invention also includes an isolated protein 
fragment comprising an amino acid sequence corresponding to that of a protein- liciting epitope contained in 
amino acid residues 192 to 260 of the PspA protein of the Rx1 strain of Streptococcus pneumoniae . 

In particular, the invention includes an isolated protein fragment comprising the amino add sequence of 
5 or effectively homologous with that of a protection-eliciting epitope corresponding to an epitope contained in 
amino acid residues 192 to 260 of the pneumococcal surface protein A (PspA) protein of the Rx1 strain of Strep- 
tococcus pneumoniae , and including no more than 25 additional amino acid residues in the NH 2 and/or the 
COOH terminal direction. 

The invention includes a vaccine containing a protein fragment of the invention. It also includes certain 
10 DNA primers or probes described herein. 

The invention will be further described with reference to the following drawings in which:- 

Figurel contains the DNAsequenceforthe p spA gene of the Rx1 strain of S. pneumoniae with the deduced 

amino acid sequence for the PspA protein; 

Figure 2 contains a schematic representation of the domains of mature PspA protein as well as identifi- 
es cation of certain plasmids containing gene sequences coding for the full length protein (pKSD 1014), cod- 
ing for specific segments of the N-terminal portion of the protein (pJY4284 or pJY4285, pJY4310, 
pJY4306) and coding for specific sequences of the C-terminal region of the protein (pBC207, pBC100); 
Figure 3 contains a schematic representation of the domains of the mature PspA protein and the general 
location of epitopes recognised by certain monoclonal antibodies; and 
20 Figure 4 is an immunoblot of PspA protein gene products produced by plasmids identified therein. 

As described in the prior U.S. Patent applications referred to above and in Yother et al (I) and (II), the pspA 
gene of strain Rx1 encodes a 65 kDa molecule composed of 588 amino acids. The nucleotide sequence (SEQ 
ID No: 1) of the psaA gene and derived amino acid sequence (SEQ ID No: 2) are set forth in Figure 1 . The N- 
terminal half of the molecule is highly charged and its DNA sequence predicts an a-helical coiled-coil protein 
25 structure for this region (288 amino acids), as seen in Figure 2. The C-terminal half of PspA, which is not a- 
helical, includes a proline-rich region (83 amino acids) and a repeat region containing the highly conserved 
twenty amino acid repeats, as well as a slightly hydrophobic sequence of 17 amino acids at the C-terminus. 
It is known that PspA is anchored to S. pneumoniae by its C-terminal half arid it is likely that the proline-rich 
region serves to tangle the molecule in the cell wall. In addition, it is anticipated that the highly-charged d-helical 
30 region begins at the cell wall and extends into and possibly through the capsule. This model is supported by 
the observation that the a-helical domain contains all the surface exposed epitopes recognized by monoclonal 
antibodies (MAbs) reactive with PspA on the pneumococcal surfaces. 

The PspA protein of S. pneumoniae strain Rx1 has been mapped to locate protection-eliciting epitopes. 
Such mapping has been effected by employing antibodies to PspA protein and recombinantf ragments of PspA. 
35 This mapping technique, described in detail in the Examples below, has identified an amino acid sequence 
corresponding to the C-terminal third of the a-helical region of PspA as containing protection-eliciting epitopes, 
specif ically the amino acid residues 192 to 260 of the Rxl PspA protein. The amino acid sequence from residues 
1 92 to 260 is the C-terminal third of the a-helical sequence, expected to be near the cell wall surface. 

Since the portion of the sequence from residues 192 to 260 contains only 68 amino acids, individual PspA 
40 protein fragments of this size may not be optimally antigenic. This difficulty is overcome by producing recom- 
binant proteins containing tandem fragments of different PspAs expressed by gene fusions of the appropriate 
portions of several pspA genes. 

Accordingly, in a further aspect of the invention, there is provided a PspA protein fragment comprising a 
plurality of conjugated molecules, each molecule comprising amino acid residues 1 92 to 260 of the PspA protein 
45 of the Rxl strain of Streptococcus pneumoniae and containing at least one protection-eliciting epitope, each 
molecule being derived from a different strain of S. pneumoniae . 

Such tandem molecules can be engineered to maintain proper coiled-coil structure at the points of junction 
and to be large enough to be immunogenic and to express an array of protection-eliciting epitopes that may 
cross-react with a wide spectrum of PspAs. Alternatively, individual recombinantly-produced peptides may be 
50 attached by chemical means to form a complex molecule. 

A further alternative is to attach the PspA fragment to a larger carrier protein or bacterial cell, either as a 
recombinant fusion product or through chemical attachment, such as by covalent or ionic attachment. 

The protein fragments and peptide analogs thereof provided herein are useful components of a vaccine 
against disease caused by pneumococcal infection. Accordingly, the present invention provides, in a yet further 
55 aspect, a vaccine comprising the PspA protein fragments defined herein as an immunologically-activ com- 
ponent thereof. 
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BIOLOGICAL MATERIALS 

The Examples which follow as well as in the accompanying drawings, reference is made to certain plasmid 
materials containing whole or truncated pspA gene sequences. The following Table II provides a summary of 
5 such materials: 



Table II 





Plasmid 


Identification 


Gene Product 


10 


PKSDIOI4 


whole gene 


amino acids 1 to 588 




PJY4284 or pJY4285 


5' terminal region 


amino acids 1 to 115 




pJY43IO 


S'-terminal region 


amino acids 1 to 192 


15 


PJY4306 


5'-terminal region 


amino acids 1 to 260 




pBC207 


3* -terminal region 


amino acids 119 to 588 




pBClOO 


3* -terminal region 


amino acids 192 to 588 



20 



EXAMPLES 
Example 1 : 

25 

This Example describes the bacterial strains, plasmids and hybridoma antibodies used herein. 

S. pneumoniae strains, identif ied in Table Ml below, were grown in Todd Hewitt broth with 0.5% yeast ex- 
tract at 37°C or on blood agar plates containing 3% sheep blood in a candle jar. E. coli strain DHI (Hanahan, 
J. Mol. Biol. 166:557) was grown in LB medium or minimal E medium. Plasmids included pUCI8 (Gene 33:103), 
30 pJY4l63 (Yother et al (ll)) f and pIN-lll-ompA (EMBO J. 3:2437). 

All antibody-secreting hybridoma lines were obtained byf usions with non-antibody-secreting myeloma cell 
line P3-X63-Ag.8.653 (J. Immunol. 123:1548). The specif ic antibodies employed are identified in Table III be- 
low. The anti-PspA hybridoma cell lines Xi64, Xil26 and XiR278 have previously been described in McDaniel 
et al (I) and Grain et al (supra) . The remaining cell lines were prepared by immunising CBA/N mice with re- 
35 combinant D39 PspA expressed inXgtll by the technique described in McDaniel etal (I). The cell lines producing 
antibodies to PspA were all identified using an ELISAin which microtitration plates were coated with heat-killed 
(60°C, 30 mins) S. pneumoniae R36Aor Rxl, which would select for those MAbs that react with surface exposed 
epitopes on PspA. The heavy chain isotypes of the MAbs were determined by developing the ELISA with af- 
finity purified goat antibody specific for \x and y heavy chains of IgM and IgG mouse immunoglobulin. The spe- 
40 cif icity of the MAbs for PspA was confirmed by immunoblot analysis. 

All six newly-produced MAbs, identified in Table III as XiR 1526, XiR 35, XiR 1224, XiR 16, XiR 1325 and 
XiR 1323, detected a protein of the expected size (apparent molecular weight of 84 kDa) in an immunoblot of 
strains Rxl and D39. No reactivity was observed for any of the MAbs in an immunoblot of strain WG44.1, a 
PspA-variant of Rxl (see McDaniel et al (III) and Yother et al (II)). 

45 



50 



55 
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Example 2 : 

This Example describes the provision of the pspA gene from pneumococcal strain Rxl by polymerase chain 
reaction (PCR). 

5 PCR primers were designed based on the sequence of the pspA gene from pneumococcal strain Rx1 (see 

Figure 1). The 5' -primers were LSM3 and LSM4. LSM3 was 28 bases in length and started at base 576 and 
LSM4 was 31 bases in length and started at base 792, and both contained an additional BamHI site. The 
3' pspA primer was LSM2 which was 33 bases in length and started at base 1990 and contained an additional 
Sail site. 

10 The nucleotide sequences for the primers are as follows 

LSM2 5 ' -GCGCGTCGACGGCTTAAACCCATTCACCATTGG-3 ' (SEQ ID NO : 3 ) 

LSM3 5' -CCGGATCCTGAGCCAGAGCAGTTGGCTG-3 ' (SEQ ID NO : 4 ) 

15 

LSM4 5 ' -CCGGATCCGCTCAAAGAGATTGATGAGTCTG-3 ' (SEQ ID NO: 5) 

Approximately 10 ng of genomic Rxl pneumococcal DNA was amplified using a 5* and 3* primer pair. The 
sample was brought to a total volume of 50 \i\ containing a final concentration of 50 mM KCI, 10 mM tris-HCI 

20 (pH 8.3), 1.5 mM MgCI 2 , 0.001% gelatin, 0.5 mM each primer and 200 mM of each deoxy-nucleoside triphos- 
phate and 2.5 U of Tag DNA polymerase. Following overlaying of the samples with 50 uJ of mineral oil, the 
samples were denatured at 94°C for 2 mins and then subjected to 10 cycles consisting of 1 min. at 94°C, 2 
min. at 50°C and 3 min. at 72°C, followed by another 20 cycles of 1 min. at 94°C, 2 min. at 60°C and 3 min. at 
72°C. After completion of the 30 cycles, the samples were held at 72°C for an additional 5 min., prior to cooling 

25 to 4°C. 

Example 3 : 

This Example describes expression of truncated PspA molecules. 

30 3'-deleted pspAs that express N-terminal fragments in E. coli and which secrete the same fragments from 

pneumococci were constructed as described in the aforementioned U.S. patent applications Serial Nos. 
835,698 and 656,773 (see also Yother et al (II), supra) . 

For expression of 5 -deleted pspA constructs, the secretion vector pIN-lll- ompA was used. Amplified psaA 
fragments were digested with Bam HI and Sail and ligated into the appropriately Bam HI/Sall- digested pIN-lll- 

35 ompA vector, providing the inserted fragment fused to the ompA leader sequence in frame and under control 
of the lac promoter. Transformants of E. coli DHI were selected on minimal E medium supplemented with ca- 
samino acids (0.1%), glucose (0.2%) and thiamine (0.05 mM) with 50 ng/ml of ampicillin. 

For induction of lac expression, bacteria were grown to an optical density of approximately 0.6 at 660 nm 
at 37°C in minimal E medium and IPTG was added to a concentration of 2 mM. The cells were incubated for 

40 an additional two hours at 37°C, harvested and the periplasmic contents released by osmotic shock. An im- 
munoblot of the truncated PspA proteins produced by the various plasmids is shown in Figure 4. 

By these procedures, there were provided, for the 3'-deleted pspA s, plasmids pJY4284, pJY4285, pJY43IO 
and pJY4306 and for the S'-deleted pspAs , plasmids pBC207 and PBC100. Plasmid pJY4284 and pJY4285 
contain an insert of 564 base pairs, nucleotides 1 to 564 and encoded a predicted 13 kDa PspA C-terminal- 

45 deleted product corresponding to amino acids 1 to 115. Plasmid pJY43IO contains an insert of 795 base pairs, 
nucleotides 1 to 795 and encoded a predicted 21 kDa C-terminal-deleted product corresponding to amino acid 
1 to 192. However pJY4306 contained an insert of 999 base pairs, nucleotides 1 to 999 and encoded a predicted 
29 kDa C-terminal-deleted product corresponding to amino acids 1 to 260. Plasmid pBC100 contained an insert 
of 1199 base pairs, nucleotides 792 to 1990, and encoded a predicted 44 kDa PspA N-terminal deleted product 

so containing amino acids 192 to 588. pBC207 contained an insert of 1415 base pairs, nucleotide 576 to 1990, 
and encoded a predicted 52 kDa PspA N-terminat deleted product containing amino acids 119 to 588. 

The pspA gene sequences contained in these plasmids code for and express amino acids as identified in 
Figure 2. 

55 Example 4 : 

This Example describes the procedure of effecting immunoassays. 

Immunoblot analysis was carried out as described in McDaniel et al (IV). The truncated PspA molecules 
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prepared as described in Example 3 or pneumococcal preparations enriched for PspA(as described in McDa- 
niel et al (II)) were electrophoresed in a 10% sodium dodecyl sulfate polyacrylamide gel and electro-blotted 
onto nitrocelluloses. The blots were probed with individual MAbs, prepared as described in Example 1. 

A direct binding ELISA procedure was used to quantitatively confirm reactivities observed by immunoblot- 

5 ting. In this procedure, osmotic shock preparations were diluted to a total protein concentration of 3 ng/ml in 
phosphate buffered saline (PBS) and 100 uJ was added to wells of Immulon 4 microtitration plates. After block- 
ing with 13 bovine serum albumin in PBS, unfractionated tissue culture supernates of individual MAbs were 
titered in duplicate by 3-fold serial dilution through 7 wells and developed as described in McDaniel et al (IV) 
using a goat anti-mouse immunoglobulin alkaline phosphate conjugated secondary antibody and alkaline 

10 phosphate substrate. Plates were read in a Dynatech plate reader at 405 nm, and the 30% end point was cal- 
culated for each antibody with each preparation. 

The protective capacity of the MAbs was tested by injecting three CBA/N mice i.p. with 0.1 ml of 1/10 di- 
lution (about 5 to 30 u.g) of each hybridoma antibody 1 hr prior to i.v. injection of 10 3 CFU of WU2 or D39 pneu- 
mococci (>100 x LD 60 ). Protection was judged as the ability to prevent death of all mice in a group. All non- 

15 protected mice died of pneumococcal infection within 48 hours post challenge. 

Example 5 : 

This Example describes mapping of the epitopes on PspA using the monoclonal antibodies described in 
20 Example 1 . 

The six newly-produced monoclonal antibodies described in Example 1 and identified in Table III were used 
along with the previously-described monoclonal antibodies Xi64, Xil26 and XiR278 to map epitopes on PspA. 

To determine whether each of the MAbs recognized different epitopes, each of them was reacted with eight 
additional S. pneumoniae strains, as identified in Table III, in immunoblots of SDS-PAGE separated proteins. 
25 Seven different patterns of activity were observed. Three antibodies, XiRI6, XiR35 and XiRI526, appeared to 
recognize epitopes found on Rxl PspA but none of the other PspAs. Accordingly, it was possible that these 
three antibodies might all react with the same epitope as Rxl PspA. 

MAb Xi64 and XN26 both reacted strongly only with epitopes on ATCC 101813, WU2 and Rxl PspAs, but 
not with PspAs of the other strains. However, it is known from studies of larger panels of PspAs (as described 
30 in McDaniel et al (III) and Crain et al) that XH26 and Xi64 recognize different determinants. 

The remaining four antibodies each exhibited unique patterns of reactivity with the panel of PspAs. Ac- 
cordingly, the nine antibodies tested recognized at least seven different epitopes on PspA. 

For reasons which are not clear, the type 2 strain D39 appeared to be uniquely able to resist the protective 
effects of antibodies to PspA (McDaniel et al (IV)). As described in McDaniel et al (I), greater than forty times 
35 the amount of Xi126 was required to passively protect against the D39 strain as compared to the WU2 strain. 
None of the six newly-produced monoclonal antibodies protected against the D39 strain. In contrast, immun- 
ization of mice with Rxl PspA elicits protection against A66, WU2 and EF6796 strains (mouse virulent pneu- 
mococci of capsular types 3, 3 and 6A respectively), all of which have PspA types that are different from those 
of Rxl and D39 (see McDaniel et al (IV)). In view of the close serologic similarity between the type 25 PspA 
40 of Rxl and type 1 PspA of WU2 (Crain et al), WU2 pneumococci were used to challenge mice that had been 
passively protected with the MAbs. All five of the MAbs that were observed to bind WU2 PspA were able to 
protect against infection with 1 000 CFU of WU2. Protective antibodies were found in IgM, IgGI, lgG2b and Ig2a 
heavy chain isotype classes. 

45 Example 6 : 

This Example describes mapping of the epitopes of PspA using the recombinant truncated PspA molecules 
formed in Example 3. 

The five-overlapping C-terminal or N-terminal deleted PspAf ragments, prepared as described in Example 
so 3 and shown in Figure 2, were used to map epitopes on PspA. The general location of the epitopes detected 
by each of the mice MAbs, as described in Example 5, was determined using the five C-terminal-deleted and 
two N-terminal deleted PspA molecules. As a positive control, the reactivity of each antibody was examined 
with a clone, pKSDIOI4, expressing full-length PspA. 

As noted earlier, the reactivity of the MAb was determined by two methods. In one method, reactivity be- 
55 tween the fragments and MAb was evaluated in immunoblots of the fragment preparations after they had been 
separated by SDS-PAGE. In the second method, a direct ELISA was used to quantify the reactivity of the MAbs 
with non-denatured PspA fragment. 

The reactivities observed and the quantification of such activity is set forth in the following Table IV: 

8 



NSDOCtD: <EP__0622061 A2_l_> 



r 

EP 0 622 081 A2 



10 



15 



m 

V 



v 



20 



25 



30 



in 

•5 
© 

< 

c 
o 

V 

o 



35 



■a 
c 

£ 

"5 

•< 



40 



45 



E 

CD 
ft 



< 



5 



50 



55 



n 
o 



O 



< 5 
U. 



+ 



v 



o 



3 



— V 



t 
a 
"8 
S 

3 I. 



*8 
a 

u 

1 

E 

1 
I 

B. 

ii 

8 
0 



e 

.2 



o 



3 



6 

M 

U) 

O 
< 

O 
to 



■5 
■5. 



1 

< 



I * 
5 * 

t l 



6 

o 



< 
c 



■S 
*> 

c 

e 

E 
E 



1 



o 

•a 



r 



5 



£3 
C 

5 



5 

J 

E 



•a 



T3 

s 



C 



"a 



4J 

0 
a) 

4J 

o 
a 
o 

4J 



J5 

«j 

o 

M 
10 

x: 
o 

-C 

G) 
VI 
O 

x: 

-M 

V) 
01 
4J 
O 

c 
o 
•o 

w 

0) 

•H 

•O 
O 
X} 
•H 
-U 
C 
CO 

<D 

jc 
-p 

<w 
O 

© 
E 
O 
(0 

4J 



(0 



o 

E 
V 

c 

D 



n 
Q 

O 



10 



•H 

GJ 
4J 
(ft 

<u 

JC 



4J 



c 
o 

4J 

o 

0) 

c 



<0 

u 
o 
o 
o 
o 
E 

o 
c 
a 



to 
c 

10 
CP 
(0 



9 



MSDOCID: <EP 0622061 A2_t_> 



EP 0 622 081 A2 

The deduced locations of the epitopes are indicated in Figure 3. 

As can be seen from the data in Table IV, three of the antibodies, Xil26 and XiR35 and XiRI526, react 
strongly with all three C-terminal-deleted clones in immunoblot analysis, indicating that the sequence required 
to form the epitope(s) detected by all three lies within the first 115 amino acids of PspA. This map position is 

5 in agreement with the failure of these antibodies to react with either of the N-terminal-deleted clones that lack 
the first 119 and 191 amino acids. 

MAb XiRI224 reacted strongly by immunoblot with the longest C-terminal-deleted fragment (pJY4306) f but 
showed substantially weaker reactions with the shorter two C-terminal-deleted fragments. This result indicates 
that, while the binding site of the antibody may be in the first 115 amino acids, residues beyond amino acid 

10 192 may be important for the conformation or stability of the epitope. 

By immunblot, the three antibodies Xi64, XiRI325 and XiR278, all reacted with the longest C-terminal-de- 
leted fragment and both of the N-terminal-deleted fragments, thus locating their determinants between amino 
acid positions 192 and 260. Generally confirmatory results were obtained in ELISAs with the native molecules. 
However, in a few cases, reactions were observed in ELISAs with full length PspA but not with a truncated 

15 molecule even though the same truncated fragment was reactive with the antibody by immunoblot. These ob- 
servations may have resulted from an altered conformation of the truncated fragments under physiologic con- 
ditions that masked or prevented the formation of determinant present in full-length PspA and in the denatured 
fragments. 

Two antibodies XiR2l6 and XiRI323 showed what, at first appeared to be anomalous reactions, indicating 
20 that epitopes detected by the antibodies might be in more than one portion of PspA. In view of this unexpected 
result, the assays were repeated multiple times with two sets of preparations of the truncated fragments. The 
results of the additional assays confirmed the two-position mapping of epitopes for these two MAbs. 

By immunoblot, MAb XiR1 6 reacted strongly with the two longest C-terminal-deleted fragments and failed 
to react with the shortest N-terminal-deleted fragment. Accordingly, the epitope detected must be N-terminal 
25 to position 192. Unexpectedly, Mab XiR16 reacted weakly in immunoblots with both the longest N-terminal- 
deleted fragment (residues 119 to 158) and the shortest C-terminal-deleted fragment (residues 1 to 115). Since 
the fragments do not overlap, and if the weak immunoblot reactivities with fragments (reactivities not seen 
by ELISA) are not an artifact, the MAb XiRI6 must recognize epitopes on both fragments. 

In the case of MAb XiRI323, the immunoblot data clearly places the detected epitope between positions 
30 192 and 260. In the ELISA studies, however, XiRI 323 reacted strongly and reproducibly with the C-terminal- 
deleted fragment pJY431 0 (amino acid residues 1 to 1 92) as well as the shortest N-terminal-deleted fragment 
pBClOO (amino acid residues 192 to 588). Curiously, an ELISA reaction was not observed between MAb 
XiRI323 and pJY4306 (amino acid residues 1 to 260), even though MAb XiRI323 reacted strongly with this frag- 
ment by immunoblot 

35 These findings provide additional evidence for distal conformation effects on antigenic determinants of 

PspA. They also indicate that, on the native fragments, MAb XiRI323 sees epitopes on both sides of position 
192. The relationship between expression of the epitopes in other PspAs and their position in Rxl PspA is dem- 
onstrated in Table IV in which is listed the antibodies in accordance with their apparent map position in PspA. 
The five antibodies (including XiRI6) that clearly recognize epitopes N-terminal to position 116 are listed at the 

40 left side of Table IV. The four antibodies that clearly recognize epitopes C-terminal to position 192 are listed 
on the right side of Table IV. Three of the five epitopes N-terminal of position 1 92 (those recognized by XiRI526, 
XiR35, and XiR16) were not found on any of the other eight PspAs tested. One epitope (recognized by XiR 
1224) was weakly expressed by one other strain and another (recognized by XU26) was expressed on two 
other strains. In contrast, the four epitopes present in the C-terminal third of the PspA a-h el ical region were 

45 each present in from two to six other strains. The greater conservation of the region C-terminal to position 
192, as compared to the region N-terminal to position 192 was significant at P<0.05 by both the Chi-square 
and the two sample rank tests. Based on the mapping results (Table III) and the strain distribution results (Table 
IV), it is apparent that all of the antibodies except possibly XiR35 and XERI526 must recognize different PspA 
determinants. 

so 

Example 7 : 

This Example contains a discussion of the mapping results achieved in Example 6. 
The results set forth in Example 6 clearly demonstrate that the protection eliciting epitopes of PspA are 
55 not restricted to the N-terminal end of the surface exposed a-helical half of the molecule. In fact, four of the 
five antibodies protective against S. pneumoniae WU2 reacted with the C-terminal third of the a-helical region 
of PspA. This portion of the a-helical region is thought to closest to the cell wall (see Yother et al (II)). 

About half of the MAbs recognized determinants N-terminal to amino acid 115 and the other half recognized 

10 
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epitopes C-terminal to residue 1 92. Since the nine antibodies were selected for their ability to bind native PspA 
on the surface of heat-killed whole pneumococci, the distribution of the epitopes they recognize suggests that 
determinants between positions 115 and 192 are either not immunogenic or are not exposed on the nativ 
molecule as expressed on pneumococci. 

5 Curiously two MAbs (XiRI6 and XiRI323) appeared to possibly react with epitopes in more than one position 

on PspA. Although the bulk of the data forXiRI6 placed its epitope N-terminal of position 115, weak immunoblot 
patterns suggested that a reactive epitope(s) may also exist C-terminal to residue 115. In the case of XiRI323 t 
the bulk of the data indicated that its epitope is between positions 192 and 260. However, the ELISA assay 
showed significant reactivity of the antibody with a C-terminal-deleted PspA fragment extending from residues 

10 1 to 192. Although there are no extensive repeats in the N-terminal half of PspA, there are a few short repeated 
sequences that occur more than once in the coiled- coil motif. One such sequence is glu-glu-ala-lys which 
starts at amino acid positions 105, 133, and 147 and another is lys-ala-lys-leu starting at positions 150 and 
220 (see Figure 1 ). In the case of XiRI323, the antibody reacted with the epjtope on the 1 to 1 92 fragment under 
natured but not denatured conditions. This may indicate that the epitope is conformational and may not have 

15 the same exact sequence as the epitope recognized (under both natured and denatured conditions) between 
residues 192 and 260. 

One mechanism that may account for the lack of exposure of epitopes between amino acid 115 and 192 
would be a folding back of this portion of the a- helical sequence on itself or other parts of PspA to form a coiled- 
coil structure more complex than a simple coiled-coil dimer. If this occurred, it could explain how PspA tertiary 

20 structure can sometimes be dependent on distant PspA structures. A suggestion that this might, in fact, be 
the case comes from the observation that some of the truncated forms did not express certain epitopes under 
physiologic conditions that were detected on the whole molecule under the same conditions and were shown 
to be present in the fragment after denaturation in SDS. 

Since a PspA vaccine may need to contain fragments of several serologically different PspAs, it would be 

25 desirable to include in a vaccine only those portions of each PspA that are most likely to elicit cross- protective 
antibodies. Based on the results presented herein with Rxl PspA, it appears likely that the portion of the PspA 
sequences corresponding to residues 192 to 260 of Rxl PspA is the best portion of PspA to include in a recom- 
binant PspA vaccine. The epitopes in this portion of PspA were three and a half times as likely to be present 
in the PspAs of other strains as the epitopes in the residue 1 to 115 portion of the sequence, and none of the 

30 9 antibodies studied clearly reacted with the middle third of the a- helical region. 

Example 8 : 

This Example shows protection of mice by PspA fragments. Five mice were immunized with purified frag- 
35 ment produced by pBC207 in E. coli and five with purified fragment produced by pBClOO in E. coli . In both 
cases, the fragments were injected in Freund's complete adjuvant. All mice immunized with each fragment 
survived challenge with 100 x LP 50 of WU2 capsular type 3 S. pneumoniae . 

Five additional mice were injected with adjuvant plus an equivalent preparation on non-PspA producing 
E. coli . All mice died when challenged with the same dose of WU2. 
40 The data presented in this Example conclusively proves that epitopes C-terminal to amino acids 119 and 

192 respectively are capable of eliciting protective immunity. This result is consistent with the findings pre- 
sented in the earlier Examples that the region of PsPa from amino acids 1 92 to 260 contain protection-eliciting 
epitopes. 

45 

SEQUENCE IDENTIFICATIONS 

DNA sequence for pspA gene (Figure 1) 
Deduced amino acid sequence for PspA protein 
(Figure 1) 

Nucleotide sequence for PCR primer LSM 2 
Nucleotide sequence for PCR primer LSM 3 
Nucleotide sequence for PCR primer LSM 4 
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In summary of this disclosure, the present invention provides a PspA protein fragment which contains pro- 
tection-eliciting epitopes and which is cross-reactive and can b incorporated into a vaccine against disease 
caused by pneumococcal infection. Modifications are possible within the scope of this invention. 

The term "effectively homologous** used herein means in relation to an amino acid sequence effectively 
5 homologous to a defined sequence, that the said amino acid sequence may not be identical to said defined 
sequence but may be at least 70 percent, more preferably 80 percent still more preferably 90 percent identical, 
provided that the antigenic epitope or epitopes in said amino acid sequence have properties substantially the 
same as the corresponding epitopes in said defined sequence. 

Now that the region constituted by residues 192 to 260 in the PspA protein of the Rx1 strain has been 
10 identified, those skilled in the art will readily be able to produce by recombinant techniques protein fragments 
according to the invention. In particular, they may tailor DNA probes to use in a PCR reaction to amplifying 
genomic DNA coding for a desired fragment, insert the amplified DNA into a suitable plasmid vector and utilise 
the vector in a known manner to express the protein in a suitable host such as E. coli , adapting the methods 
taught in Example 3 above. 

15 It will be possible to clone and express the appropriate pspA fragments and express their truncated prod- 

ucts under the control of an appropriate promoter, e.g. a vector containing the E.coli lac promoter expressing 
the E. coli ompA and leader sequence to create an ompA::pspA fusion plasmid. Optionally, the sequence cod- 
ing for the PspA fragment may be linked to a sequence coding for a further protein suitable for injection into 
humans. Such proteins would likely be those already used as vaccines because they are known to elicit pro- 

20 tecthve immune responses and/or known to function as strong immunologic carriers. Such proteins could in- 
clude the partial or complete amino acid sequence of toxins such as tetanus toxin, or outer membrane proteins 
such as that of group B subtype 2 Neisseria meningitis . 

It will also be possible to produce a fusion protein composed of the cross-reactive protection-eliciting re- 
gions of several different PspA molecules. Such a fusion protein could be made large enough (^40,000 mo- 

25 lecular weight) to be highly immunogenic and as a single protein could elicit cross-protection to as many dif- 
ferent pneumococci as possible. The combination of cross- protective 70 amino acid regions from 5 to 6 PspAs 
would be large enough to be highly immunogenic. Constructs expressing epitopes from more than one PspA 
are especially attractive since PspAs of pneumococci are known to differ serologically. Present evidence in- 
dicates that a widely protective vaccine will need to contain cross-reactive protection -eliciting epitopes from 

30 more than one different pneumococcus. 

It is possible to design such a fusion protein so that it also carries a domain that would help with isolation 
by including the choline binding region of PspA, or a ligand binding domain from other proteins (such as the 
maltose binding protein [encoded by malE] of E. coli . In the former case the fusion protein could be isolated 
by adsorption to a choline Sepharose column and elution using 2% choline chloride. In the latter case adsorp- 

35 tion would be to a mannose-Sepharose column followed by elution with a solution containing mannose. 

In the construction of such a fusion protein containing tandem cross-reactive coiled-coil PspA regions it 
will be critical not only that the appropriate open reading frame of each down stream gene fragment be pre- 
served at the junctions of the ligated gene fragments, but that the heptad motif of the coiled-coil amino acid 
sequence not be disrupted. One way to accomplish the latter would be to construct the gene fusions so that 

40 they occur within naturally occurring noncoi I- coiled regions found in the a-helical domain of PspA. In our pre- 
vious report (Yother and Briles J. Bact. T/4:601-609) such non-coiled-coil breaks were identified at amino acid 
positions 169-176, 199, 225, 254, 274 and 289. Fusions between two or more cross-protective regions (resi- 
dues 192-260) at or near positions 170 or 1 99 at one end and at or near residues 274 or 289 at the other end, 
would be expected to very likely be able to express the epitopes normally expressed within the coiled-coil re- 

45 gions. 

In each case, the easiest way to prepare such constructs would be by PCR amplification of the DNA used 
to construct the gene fusions. In this way it will be possible to prepare the relevant sequence with appropriate 
restriction sites. The design of gene fusions and the PCR primers used to produce the individual pspA frag- 
ments will be carried out so that the proper reading frame will be preserved in each fused gene fragment at 
so the nucleotide level. 

It is also possible to synthesise peptides according to the invention having the appropriate amino acid se- 
quence by conventional peptide synthesis. 
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SEQUENCE LISTINGS 
(1) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20B5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



{Li) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
<iv) ANT I - SENSE : NO 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: Rxl 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: JY4313 

(ix) FEATURE: 

(A) NAME /KEY : intron 

(B) LOCATION: 1..20B5 

(ix) FEATURE: 
25 < A > NAME /KEY : CDS 

(B) LOCATION: join (127 1984) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
3Q AAGCTTATGA TATAGAAATT TGTAACAAAA ATGTAATATA AAACACTTGA CAAATATTTA 60 



120 
168 



CGGAGGAGGC TTATACTTAA TATAAGTATA GTCTGAAAAT GACTATCAGA AAAGAGGTAA 

ATTTAG ATG AAT AAG AAA AAA ATG ATT TTA ACA AGT CTA GCC AGC GTC 
Met Asn Lys Lys Lys Met He Leu Thr Ser Leu Ala Ser Val 
1 5 10 

CCT ATC TTA GGG GCT GGT TTT GTT GCG TCT CAG CCT ACT GTT GTA AGA 216 
Ala He Leu Gly Ala Gly Phe Val Ala Ser Gin Pro Thr Val Val Arg 
15 20 25 30 

I CT CCC GTA GCC AGT TCT GCT GAG AAA GAC TAT 264 

Ala Glu Glu Ser Pro Val Ala Ser Gin Ser Lye Ala Glu Lys Asp Tyr 
35 40 45 

GAT GCA GCG AAG AAA GAT GCT AAG AAT GCG AAA AAA GCA GTA GAA GAT 312 
Asp Ala Ala Lys Lys Asp Ala Lys Asn Ala Lys Lys Ala Val Glu Asp 
50 55 60 

GCT CAA AAG GCT TTA GAT GAT GCA AAA GCT GCT CAG AAA AAA TAT GAC 360 
Ala Gin Lys Ala Leu Asp Asp Ala Lys Ala Ala Gin Lys Lys Tyr Asp 
65 70 75 

GAG GAT CAG AAG AAA ACT GAG GAG AAA GCC GCG CTA GAA AAA GCA GCG 408 
Glu Asp Gin Lys Lys Thr Glu Glu Lys Ala Ala Leu Glu Lys Ala Ala 
50 80 85 90 
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10 



15 



30 



35 



40 



50 



55 



14 



456 



504 



552 



6 00 



648 



6 96 



TCT GAA GAG ATG GAT AAG GCA GTG GCA GCA GTT CAA CAA GCG TAT CTA 
Ser Glu Glu Met Asp Lys Ala Val Ala Ala Val Gin Gin Ala iyr Leu 
95 100 105 110 

111 ^ * f 1 * -u* GAC ^ GCC GCA AAA GAC ^ GCA GAT AAG 

Ala Tyr Gin Gin Ala Thr Asp Lys Ala Ala Lys Asp Ala Ala Asp Lys 

115 120 125 

m2 ?lt ^ ^ *** CGC GAA GAA 071(3 GCA MA ACT AAA TTT 

Met lie Asp Glu Ala Lys Lys Arg Glu Glu Glu Ala Lys Thr Lys Phe 

130 135 140 

G J? £GA GCA ATG GTA GTT CCT GAG CCA GAG CAG TTG GCT GAG 
Asn Thr Val Arg Ala Met Val Val Pro Glu Pro Glu Gin Leu Ala Glu 
145 150 155 

ACT AAG AAA AAA TCA GAA GAA GCT AAA CAA AAA GCA CCA GAA CTT ACT 
Thr Lys Lys Lys Ser Glu Glu Ala Lys Gin Lys Ala Pro Glu Leu Thr 
160 165 170 

AAA AAA CTA GAA GAA GCT AAA GCA AAA TTA GAA GAG GCT GAG AAA AAA 

17§ iJo ^ ^ ^ ^ 1J5 GlU LyS 

GCT ACT [GAA GCC AAA CAA AAA GTG GAT GCT GAA GAA GTC GCT CCT CAA 744 
Ala Thr Glu Ala Lys Gin Lys Val Asp Ala Glu Glu Val Ala Pro Gin 
• 195 200 205 

25 GGT fAA ATC GCT GAA TTG GAA AAT CAA GTT CAT AGA CTA GAA CAA GAG 792 

Ala Lys lie Ala Glu Leu Glu Asn Gin Val His Arg Leu Glu Gin Glu 
210 215 220 

CTC AAA GAG ATT GAT GAG TCT GAA TCA GAA GAT TAT GCT AAA GAA GGT 
Leu Lys Glu lie Asp Glu Ser Glu Ser Glu Asp Tyr Ala Lys Glu Gly 
22 5 - 230 235 " 

TTC CGT GCT CCT CTT CAA TCT AAA TTG GAT GCC AAA AAA GCT AAA CTA 
Phe Arg Ala Pro Leu Gin Ser Lys Leu Asp Ala Lys Lys Ala Lys Leu 
240 245 250 

TCA AAA CTT GAA GAG TTA AGT GAT AAG ATT GAT GAG TTA GAC GCT GAA 
Ser Lys Leu Glu Glu Leu Ser Asp Lys lie Asp Glu Leu Asp Ala Glu 

26 0 265 270 

ATT GCA AAA CTT GAA GAT CAA CTT AAA GCT GCT GAA GAA AAC AAT AAT 
lie Ala Lys Leu Glu Asp Gin Leu Lys Ala Ala Glu Glu Asn Asn Asn 
275 280 ^ 285 

GTA GAA GAC TAC TTT AAA GAA GGT TTA GAG AAA ACT ATT GCT GCT AAA 
Val Glu Asp Tyr Phe Lys Glu Gly Leu Glu Lys Thr lie Ala Ala Lvs 
290 295 300 

AAA GCT GAA TTA GAA AAA ACT GAA GCT GAC CTT AAG AAA GCA GTT AAT 
Lys Ala Glu Leu Glu Lys Thr Glu Ala Asp Leu Lys Lys Ala Val Asn 
45 305 310 3I5 

GAG CCA GAA AAA CCA GCT CCA GCT CCA GAA ACT CCA GCC CCA GAA GCA 
Glu Pro Glu Lys Pro Ala Pro Ala Pro Glu Thr Pro Ala Pro Glu Ala 
320 325 330 
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25 



30 



40 



A?I ^ CCA CCA GCG CCG GCT CCT CAA CCA GCT CCC GCA 

Pro Ala Glu Gin Pro Lys Pro Ala Pro Ala Pro Gin Pro Ala Pro Ala 
5 340 345 350 

Pr^ ^ ^ G ^ CCA G ? T GAA CCA AAA CCA GAA AAA ACA GAT 

Pro Lys Pro Glu Lys Pro Ala Glu Gin Pro Lys Pro Glu Lys Thr Asp 
355 360 365 

Asn S£ ^ ^ ^ GAC TAT GCT CGT AGA TCA GAA GAA GAA TAT 

10 Asp Gin Gin Ala Glu Glu Asp Tyr Ala Arg Arg Ser Glu Glu Glu Tyr 

370 375 380 

AAT CGC TTG ACT CAA CAG CAA CCG CCA AAA GCT GAA AAA CCA GCT CCT 
Asn Arg Leu Thr Gin Gin Gin Pro Pro Lys Ala Glu Lys Pro Ala Pro 
385 390 395 

15 ^ AAA ACA GGG TGG AAA CAA GAA AAC GGT ATG TGG TAC TTC TAC 

LyS Thr Gly Tr P ^ Gln Glu Asn Gly Met Trp Tyr Phe Tyr 
400 405 410 

££n t£I 2? T 1°* ATG GCG ACA GGA TGG CTC °AA AAC AAC GGT TCA 

Asn Thr Asp Gly Ser Met Ala Thr Gly Trp Leu Gin Asn Asn Gly Ser 

20 4 15 ' 42 <> 425 430 

TGG TAC TAC CTC AAC AGC AAT GGT GCT ATG GCT ACA GGT TGG CTC CAA 
Trp Tyr Tyr Leu Asn Ser Asn Gly Ala Met Ala Thr Gly Trp Leu Gin 
435 440 445 

TAC AAT GGT TCA TGG TAT TAC CTC AAC GCT AAC GGC GCT ATG GCA ACA 
Tyr Asn Gly Ser Trp Tyr Tyr Leu Asn Ala Asn Gly Ala Met Ala Thr 
450 455 460 

GGT TGG GCT AAA GTC AAC GGT TCA TGG TAC TAC CTC AAC GCT AAT GGT 
Gly Trp Ala Lys Val Asn Gly Ser Trp Tyr Tyr Leu Asn Ala Asn Gly 
465 - 470 475 J 

GCT ATG GCT ACA GGT TGG CTC CAA TAC AAC GGT TCA TGG TAT TAC CTC 

!?on r Gly Trp Leu Gln Asn Gly Ser Trp Tyr Tyr Leu 

480 485 490 



AAC GCT AAC GGC GCT ATG GCA ACA GGT TGG GCT AAA GTC AAC GGT TCA 
Asn Ala Asn Gly Ala Met Ala Thr Gly Trp Ala Lys Val Asn Gly Ser 
495 500 505 510 

TGG TAC TAC CTC AAC GCT AAT GGT GCT ATG GCT ACA GGT TGG CTC CAA 
Trp Tyr Tyr Leu Asn Ala Asn Gly Ala Met Ala Thr Gly Trp Leu Gln 
515 520 525 

TAC AAC GGT TCA TGG TAC TAC CTC AAC GCT AAC GGT GCT ATG GCT ACA 
Tyr Asn Gly Ser Trp Tyr Tyr Leu Asn Ala Asn Gly Ala Met Ala Thr 
530 535 540 

GGT TGG GCT AAA GTC AAC GGT TCA TGG TAC TAC CTC AAC GCT AAT GGT 
Gly Trp Ala Lys Val Asn Gly Ser Trp Tyr Tyr Leu Asn Ala Asn Gly 
45 545 550 555 

GCT ATG GCA ACA GGT TGG GTG AAA GAT GGA GAT ACC TGG TAC TAT CTT 

Ala Met Ala Thr Gly Trp Val Lys Asp Gly Asp Thr Trp Tyr Tyr Leu 
560 565 570 

50 
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(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 619 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

25 <*i) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Asn Lys Lys Lys Met He Leu Thr Ser Leu Ala Ser Val Ala He 

5 10 15 

Leu Gly Ala Gly Phe Val Ala Ser Gin Pro Thr Val Val Arg Ala Glu 
30 ^° 25 30 

Glu ser Pro Val Ala Ser Gin Ser .Lys Ala Glu Lys Asp Tyr Asp Ala 

40 45 

Ala Lys Ly S Asp Ala Lys Asn Ala Lys Lys Ala Val Glu Asp Ala Gin 

55 60 

Lys Ala Leu Asp Asp Ala Lys Ala Ala Gin Lys Lys Tyr Asp Glu Asp 

70 75 80 

Gin Lys Lys Thr Glu Glu Lys Ala Ala Leu Glu Lys Ala Ala Ser Glu 

5 90 95 

Glu Mec Asp Lys Ala Val Ala Ala Val Gin Gin Ala Tyr Leu Ala Tyr 
100 ios 110 

Gin Gin Ala Thr Asp Lys Ala Ala Lys Asp Ala Ala Asp Lys Met He 
Xli > 120 12 5 

Asp Glu Ala Lys Lys Arg Glu Glu Glu Ala Lys Thr Lys Phe Asn Thr. 
"° 135 140 

14 5 ^ Ala Met Val Pro Glu Pro Glu Gin Leu Ala Glu Thr Lys 

150 15S 160 



1896 



Si S S SJ S K 2» S £ ffi S SS JS 12 £ 

- 585 - 590 

s: s s ™ ss s ss k s s s s s ss s s - 

SHb 600 605 

S £ S5 SS.gi SS-JS S IS-S5 Si £ SI T ^ occ °" 

615 

TAA ATT AAA GCA TGT TAA GAA CAT TTG ACA TTT TAA TTT TGA AAC AAA 
GAT AAG GTT CGA TTG AAT AGA TTT ATG TTC GTA TTC TTT AGG TAC 



2040 
2085 



50 
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Lys Lys Ser Glu Glu Ala Lys Gin Lys Ala Pro Glu Leu Thr Lys Lys 
165 170 175 

Leu Glu Glu Ala Lys Ala Lys Leu Glu Glu Ala Glu Lys Lys Ala Thr 
5 180 185 190 

Glu Ala Lys Gin Lys Val Asp Ala Glu Glu Val Ala Pro Gin Ala Lys 
195 200 205 

He Ala Glu Leu Glu Asn Gin Val His Arg Leu Glu Gin Glu Leu Lys 
10 210 215 220 

Glu He Asp Glu Ser Glu Ser Glu Asp Tyr Ala Lys Glu Gly Phe Arg 
225 230 235 240 

Ala Pro Leu Gin Ser Lys Leu Asp Ala Lys Lys Ala Lys Leu Ser Lys 
15 245 250 255 

Leu Glu Glu Leu Ser Asp Lys lie Asp Glu Leu Asp Ala Glu He Ala 
260 265 270 

Lys Leu Glu Asp Gin Leu Lys Ala Ala Glu Glu Asn Asn Asn Val Glu 
20 275 280 285 

jF£ r Phe L Y S Gly Leu Glu Lys Thr He Ala Ala Lys Lys Ala 

290 295 300 

Glu Leu Glu Lys Thr Glu Ala Asp Leu Lys Lys Ala Val Asn Glu Pro 
25 305 310 315 320 

Glu Lys Pro Ala Pro Ala Pro Glu Thr Pro Ala Pro Glu Ala Pro Ala 
325 330 335 

Glu Gin Pro Lys Pro Ala Pro Ala Pro Gin Pro Ala Pro Ala Pro Lys 
30 340 " " 345 350 

Pro Glu Lys Pro Ala Glu Gin Pro Lys Pro Glu Lys Thr Asp Asp Gin 
355 360 365 

Gin Ala Glu Glu Asp Tyr Ala Arg Arg Ser Glu Glu Glu Tyr Asn too 
35 370 . 375 380 

Leu Thr Gin Gin Gin Pro Pro Lys Ala Glu Lys Pro Ala Pro Ala Pro 
385 390 395 400 

Lys Thr Gly Trp Lys Gin Glu Asn Gly Met Trp Tyr Phe Tyr Asn Thr 
40 405 410 415 

Asp Gly Ser Met Ala Thr Gly Trp Leu Gin Asn Asn Gly Ser Trp Tyr 
420 425 430 

Tyr Leu Asn Ser Asn Gly Ala Met Ala Thr Gly Trp Leu Gin Tyr Asn 
45 435 440 445 

Gly Ser Trp Tyr Tyr Leu Asn Ala Asn Gly Ala Met Ala Thr Gly Trp 
450 455 460 

Ala Lys Val Asn Gly Ser Trp Tyr Tyr Leu Asn Ala Asn Gly Ala Met 
50 465 470 475 480 
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10 



15 



25 



30 



45 



Ala Thr Gly Trp Leu Gin Tyr Asn Gly Ser Trp Tyr Tyr Leu Asn Ala 

85 490 495 

Asn Gly Ala Met Ala Thr Gly Trp Ala Lys Val Asn Gly Ser Trp Tyr 

Tyr Leu Asn Ala Asn Gly Ala Met Ala Thr Gly Trp Leu Gin Tyr Asn 

3J -=> 520 



525 



Gly ser Trp Tyr Tyr Leu Asn Ala Asn Gly Ala Met Ala Thr Gly Trp 

535 540 
Ala Lys Val Asn Gly Ser Trp Tyr Tyr Leu Asn Ala Asn Gly Ala Met 

550 555 5 6 o 

Ala Thr Gly Trp Val Lys Asp Gly Asp Thr Trp Tyr Tyr Leu Glu Ala 
565 570 575 

ser Gly Ala Met Lys Ala Ser Gin Trp Phe Lys Val Ser Asp Lys Trp 



20 !. J f| Asn G1 y Leu G1 y Leu Ala Val Asn Thr Thr Val Asp 



585 59 0 

Thr 
605 



GlY 610 LyS Val Asn Ala Asn G1 y Glu Trp Val 



615 



(3) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



35 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GCGCGTCGAC GGCTTAAACC CATTCACCAT TGG 
{ 4 ) INFORMATION FOR SEQ ID NO : 4 : 

40 (i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
CCGGATCCTG AGCCAGAGCA GTTGGCTG 



33 
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(5) INFORMATION FOR SEQ ID NO : 5 ; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
CCGGATCCGC TCAAAGAGAT TGATGAGTCT G 



31 



20 Claims 

1 . An isolated pneumococcal surface protein A (PspA) protein fragment comprising amino acid residues cor- 
responding to all or some of amino acid residues 192 to 260 of the PspA protein of the Rx1 strain of Strep- 
tococcus pneumoniae containing at least one protection-eliciting epitope and optionally up to a further 25 

25 resides of said protein in the NH 2 terminal direction and/or the COOH terminal direction, or being effec- 

tively homologous with such a protein fragment. 

2. A protein fragment as claimed in Claim 1, containing an amino acid sequence corresponding to amino 
acid residues 192 to 260 of the PspA protein of the Rx1 strain. 

30 

3. A protein fragment as claimed in Claim 2, having said amino acid sequence. 

4. A protein fragment as claimed in Claim 1, 2 or 3 containing an amino acid sequence effectively homolo- 
gous to the amino acid residues 1 92 to 260 of the PspA protein of said Rx1 strain. 

A protein fragment as claimed in Claim 4, constituted by an amino acid sequence effectively homologous 
to the amino acid residues 192 to 260 of the PspA protein of said Rx1 strain. 

6. A protein fragment claimed in any one of Claims 1 to 5 which is produced recombinantly. 

40 7. An isolated protein fragment comprising the amino acid sequence of or effectively homologous with that 
of a protection-eliciting epitope corresponding to an epitope contained in amino acid residues 1 92 to 260 
of the pneumococcal surface protein A (PspA) protein of the Rx1 strain of Streptococcus pneumoniae , 
and including no more than 25 additional amino acid residues in the NH 2 and or the COOH terminal di- 
rection. 



8. A pneumococcal surface protein A (PspA) protein fragment comprising a plurality of conjugated molecules, 
each molecule comprising an isolated protein f ragment corresponding to or effectively homologous with 
a protection-eliciting epitope corresponding to an epitope located in residues 192 to 260 of the PspA of 
strain Rx1 molecules within said plurality optionally being derived from different strains of S. pneumoniae . 

9. A vaccine against disease caused by pneumococcal infection, comprising, as an immunologically-active 
component, a PspA protein fragment as claimed in any one of Claims 1 to 8. 

10. A vaccine as claim d in Claim 9, characteris d in that said PspA protein fragm nt is as claimed in any 
one of Claims 1 to 7 and is conjugated to a larg r mol cul . 

11. A DNA primer or probe having the nucleotide sequence:- 
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5 ' - CCGGATCCTGAGCCAGAGCAGTTGGCTG - 3 J 

12. A DNA primer or probe having the nucleotide sequence:- 

5 ' -CCGGATCCGCTCAAAGAGATTGATGAGTCTG- 

5 . . 
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